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DESCRIPTION 
METHOD FOR ISOLATING SATELLITE SEQUENCES 



Technical Field 

The present invention relates to a method for isolating satellite 
sequences. Satellite sequences, which are useful as population 
genetic markers, can be used as markers for genetic linkage analyses, 
etc . Therefore , efficient technique for isolating them is an important 
subject of research in the field of genome analysis. 

Background Art 

It has been known that eukaryotic genomes contain repetitive 
sequences consisting of similar nucleotide sequence repeats. The 
first discovered repetitive sequence has a repeat unit consisting 
of satellite DNA, which is a long sequence ranging from several hundred 
to several thousand bases, and is called a satellite sequence 
(Bioscience, 27: 790-796, 1977). Repetitive sequences consisting of 
shorter nucleotide sequence units were later identified. They have 
been named, depending on the size of their repeat unit, microsatellite 
sequences (Am. J. Hum. Genet., 4: 397-401, 1989) when they have a 
repeat unit of 2 to 5 bases (Nucleic Acids Res. 9: 5931-5947, 1981) 
or minisatellite sequences (Nature, 314: 67-73, 1985 ) when they have 
a repeat unit of 10 to 64 bases (Nature, 295: 31-35, 1982). 
Microsatellite sequences are also called simple sequences (Nucleic 
Acids Res. 17: 6463-6471, 1989) or short tandem repeats (Am. J. Hum. 
Genet. 49: 746-756), etc. 

Use of microsatellite sequences as markers was not reported 
when they were discovered. However, after their polymorphism was 
confirmed by polymerase chain reaction (PCR) (Am. J. Hum. Genet. 4: 
397-401, 1989) , they have attracted attention of researchers on their 
use as markers in a variety of areas . Specifically, their application 
to pedigree or lineage discrimination and individual identification 
of humans, animals, and plants is advanced. Because microsatellite 
sequences are scattered throughout a genome and abound in variation, 
they are good genetic markers. Furthermore, microsatellite DNA 



polymorphism contains many polymorphic gene loci and a large number 
of alleles per gene locus- In addition, based on PGR, microsatellite 
DNA polymorphism analysis is easy to perform. The amplified products 
are easily detected as single or double bands by electrophoresis, 
which simplifies determination of individual types and facilitates 
the data processing- Therefore, microsatellite DNA markers have 
become widely used as the most effective markers for population genetics 
(J. Fish. Biol., 47: 29-55, 1995). 

For microsatellite DNA polymorphism analysis, first, a variety 
of microsatellite DNA should be isolated from individual species to 
be analyzed. In addition, the microsatellite DNA regions should be 
amplified by PGR to detect the microsatellite polymorphism. 
Appropriate primers are needed for PGR. This means that, to perform 
microsatellite DNA polymorphism analysis, efficient methods for 
isolating microsatellite DNA from the species and for designing PGR 
primers capable of amplifying the microsatellite region are required. 

The inventors have already reported a method that is expected 
to allow efficient isolation of microsatellites from poultry, for 
which the microsatellite polymorphism analysis is not advanced ( Jpn. 
Poult. Sci., 33: 292-299, 1996). This method is an improvement on 
a conventional method of microsatellite sequence isolation that 
consists of a series of manipulations: fragmentation of a genome with 
restriction enzymes, insertion of the fragments into a vector, 
extension reaction with (TG)„ primers, and then cloning (Proc. Natl. 
Acad. Sci. USA 89: 3419-3423, 1992). Namely , isolation of unknown 
microsatellite sequences was achieved by choosing a vector with high 
transformation efficiencies, by specifically digesting 
single-stranded DNA with mung bean nuclease, or by removing bacterial 
DNA and RNA with DNase I and RNase A. In chicken, it is considered 
difficult to efficiently isolate microsatellite sequences, because 
the number of microsatellite sequences is small. The above method 
could achieve, by calculation, six times more efficient isolation 
than the known method that was used in an attempt to isolate chicken 
microsatellite sequences (Poultry Science, 74 : 1855-1874 , 1995). 

However, even by this method, it was impossible to prevent 
problematic clones from contaminating the microsatellite DNA clones 
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obtained. Namely, a high rate of duplicate clones was found among 
the clones obtained, suggesting bias in the constitution of clones 
as well as low efficiency. 

Microsatellite sequences are required to be isolated according 
5 to each species. There are few species for which the isolation of 
microsatellite sequences is in progress, and it is still necessary 
to isolate microsatellite sequences f rom many species • However, there 
are a number of species-specific problems, for example, low frequency 
of microsatellite sequences in the chicken genome. Therefore^ it is 
10 useful to provide a novel technique for isolating microsatellite 
sequences, also because it will enable selection of a variety of 
approaches . 

Disclosure of the Invention 
15 An objective of the present invention is to provide a method 

for isolating satellite sequences with high isolation efficiency. 
The present invention provides a highly efficient isolation method 
that any known technique cannot attain. 

The present inventors presumed that one of the major causes 

2 0 that generate duplicate clones during isolation of satellite sequences 

by known techniques might come from the manipulation for fragmentation 
of the genomic DNA. in other words, the possibility that, when digested 
with restriction enzyme, the genomic library is biased due to the 
sequence-specific cleavage cannot be excluded. 
25 Therefore, the inventors attempted to utilize a nucleotide 

sequence-independent cleavage method in which more arbitrary cleavage 
of the genomic DNA is expected. As a result, they found that cleavage 
of genomic DNA is possible with nucleotide sequence-independent 
DNA-digesting enzymes or by a physical action. However, since most 

3 0 of the physically cleaved genomic DNA fragments are not phosphorylated 

at their ends, they are not allowed to be efficiently incorporated 
into vectors using an enzyme. In addition, since physically cleaved 
fragments have an irregularly protruding strand at their cleavage 
site, they cannot be ligated. The inventors have overcome these 
35 problems by using several enzymatic treatments, and established a 
method to obtain more homogeneous genomic library. Furthermore, they 
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have confirmed that the library thus obtained enables efficient 
isolation of satellite sequences, thereby completing the present 
invention. As used herein, the term "satellite sequences" indicates, 
unless otherwise mentioned, any repeat sequences which include 
5 microsatellite sequences and minisatellite sequences . Also as used 
herein, "microsatellite sequences" and "minisatellite sequences" 
indicate repeat sequences consisting of repeat units of 2 to 5 bp 
and 10 to 64 bp, respectively. Thus, the present invention relates 
to the following methods for isolating satellite sequences - 
10 (1) An isolation method for satellite sequences, wherein a genomic 

DNA is cleaved by a nucleotide sequence-independent method, the 

isolation method comprising: 

a) obtaining randomly cleaved fragments of the genomic DNA and 

b) selecting, from the fragments obtained in a) , fragments comprising 
15 the satellite sequences. 

(2) The isolation method of (1)/ wherein the nucleotide 
sequence-independent method is a physical cleavage method or an 
enzymatic cleavage method. 

(3) The isolation method of (2) , wherein the physical cleavage method 
20 is sonication. 

(4) The isolation method of (3), wherein the ends of the genomic DNA 
that have been fragmented by sonication are to be blunted. 

(5) The isolation method of (4), wherein the ends are to be blunted 
with DNA polymerase having single strand-specific endonuclease 

25 activity and 3 ' ^ 5' exonuclease activity. 

(6) The isolation method of (2), wherein a nucleotide 
sequence-independent endonuclease is used in the enzymatic 
cleavage method . 

(7) The isolation method of (6), wherein the nucleotide 
30 sequence- independent endonuclease is DNase I. 

(8) The isolation method of (1) , wherein the satellite sequences are 
microsatellite sequences. 

(9) Use of satellite sequences isolated by the isolation method of 
any one of (1) to (8) as DNA markers. 

3 5 In the present invention, it is critical to cleave a genomic 

DNAusing a nucleotide sequence-independent cleavage method that gives 
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random fragments. Cleavage of a genomic DNA using a nucleotide 
sequence-independent cleavage method refers to cleaving DNA without 
dependence on structural characteristics (i.e., nucleotide sequences ) 
of the DNA* Therefore, neither chemical reactions in which DNA is 
5 cleaved in a nucleotide sequence-dependent manner nor digestion with 
restriction enzymes constitutes the cleavage method of the present 
invention. The genomic DNA to which the treatment by these actions 
is to be applied can be prepared by known methods . Namely, cells are 
lysed enzymatically with protease, and are sub jected to DNA extraction 

10 using an appropriate solvent. 

For cleavage of a genomic DNA using a nucleotide 
sequence-independent cleavage method, any of cleavage methods, for 
example, those based on physical actions and those based on the action 
of DNA-digesting enzymes that do not recognize the nucleotide sequence , 

15 can be used. Physical actions include sonication and agitation. On 
the other hand, as DNA-digesting enzymes that do not recognize the 
nucleotide sequence, DNase I and the like are known. Any of these 
actions can be applied to the present invention. Above all, treatment 
by sonication is one of the most desirable manipulations with excellent 

20 reproducibility. By sonication, a large quantity of fragments with 
uniform lengths can be obtained in a short time. In contrast , partial 
digestion of a genomic DNA with DNase I requires strict setting of 
conditions for reproducibly obtaining fragments with uniform lengths . 
In addition, this method often requires troublesome manipulations. 

25 As to the extent of cleavage of a genomic DNA, those skilled 

in the art can empirically set the conditions that will permit efficient 
production of fragments having the optimal size for the satellite 
sequence of interest . For example, if isolation of the microsatellite 
sequences is intended, such conditions that give a library of fragments 

3 0 of 3 00 to 900 bp are selected. Considering the operation of ligating 
the cleaved fragments into a vector, it is desirable to use conditions 
under which the fragments the majority of which have the size of 300 
to 500 bp or 500 to 800 bp are generated. This is because, if fragments 
contain integral multiples of smaller fragments, such fragments are 

3 5 indistinguishable from the ligated products of more than one smaller 
fragment. For instance, when sonication at 20 kHz with amplitude of 
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10 is performed on ice, fragments of 300 to 500 bp can be efficiently 
obtained if the sonication of 1-minute duration is performed 1 to 
5 times . Since a prolonged sonication may heat the sample and denature 
DNA, it is desirable to repeat the sonication with shorter time duration* 
5 The genomic DNA cleaved by physical actions such as sonication 

tends to exhibit poor ligation efficiency, which prevents efficient 
incorporation of the DNA into vectors • For this reason , it is des irable 
to blunt the ends of the DNA fragments. In the present invention, 
any of the known blunting methods can be used : removal of the protruding 

10 ends with single strand-specific endonuclease, and further synthesis 
of the complementary strand at the 3 ' recessed ends with DNA polymerase 
to ensure the blunting. Specifically, mung bean nuclease and SI 
nuclease can be used as the single strand-specific endonucleases . 
On the other hand, DNA polymerase having 3'—^ 5' exonuclease activity 

15 can be used as the DNA polymerase. Such DNA polymerases include T4 
DNA polymerase, pfu polymerase, and KOD polymerase. Mung bean nuclease, 
which is used in the Example, is a single strand-specific endonuclease 
and achieves blunting by digesting the protruding portion of the single 
strands of the DNA. In the Example, the action of T4 DNA polymerase 

2 0 is further utilized to ensure the blunting. T4 DNA polymerase digests 

the protrusion at the 3 ' ends by its strong 3 ' 5 ' exonuclease activity , 
and at the same time, it can synthesize complementary strands at the 
3' recessed ends, thereby giving blunt ends. However, T4 DNA 
polymerase cannot blunt a recessed 3' end with a phosphate group. 
25 Therefore, it is desirable to use mung bean nuclease together. In 
such a case, treatment with mung bean nuclease cleaves gaps generated 
in the double-stranded DNA by sonication, as well as blunts the ends. 
As a result , fragments with lengths shorter than expected may be produced . 
In order to obtain a library free from shorter fragments , it is desirable 

3 0 to provide a step of isolating the fragments of expected size after 

treating with mung bean nuclease. Fragments of expected sizes can 
be isolated by agarose gel electrophoresis or by gel filtration. The 
blunt-ended genomic DNA fragments can be directly ligated into vectors . 
Alternatively, they can be phosphorylated at their 5' ends to improve 
35 the efficiency of ligation. T4 polynucleotide kinase can be used for 
phosphorylation. 
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The genomic DNA fragments with phosphorylated 5' ends can be 
self-ligated during the ligation into the vector. To prevent the 
self -ligation, the concentration of the fragments should be reduced, 
while excess concentration of the vector is employed so as to increase 
5 the opportunities for the fragments to be ligated into the vector. 
For the ligation reaction, T4 DNA ligase works in the ligation buffer 
and ligation efficiency can be further improved by carrying out the 
ligation in the presence of the restriction enzyme that have been 
used to cleave the vector. For example, when pCR-ScriptSK ( + ) is used 
10 as the vector, restriction enzyme Srfl can help keeping the vector 
in a linear state. Since Srfl does not act on the vector into which 
an insert is ligated, one can expect the ligation efficiency to be 
improved. 

In the present invention, not only physical actions but also 

15 nucleotide sequence-independent DNA-digesting enzymes can be used 
for fragmentation of DNA. These enzymes include, for example, DNase 
I, etc. For the enzymatic treatment with DNase I, the conditions that 
allow the uniform action of the enzyme on the whole genome and that 
also allow production of the fragments with expected sizes should 

20 be empirically provided. For example, in order to maintain the 
enzymatic action in a uniform state, genomic DNA with highest purity 
should be used after nuclear proteins are removed. Also, the enzymatic 
reaction should be done as quickly as possible in order to avoid 
producing a large number of small fragments. In addition, conditions 

25 such as reaction time and temperature should be accurately controlled 
in order to maintain high reproducibility. After the enzymatic 
reaction is completed, the reaction is stopped by heating or other 
means. The nucleic acid component is recovered from the reaction, 
and, if necessary, particular fragments are extracted, followed by 

30 incorporation of the fragments into the vectors as described above. 
The fragments of the genomic DNA digested with the enzyme are blunt-ended 
and phosphorylated, which allows them to be directly used for ligation. 

The vectors used in the present invention are not specially 
restricted- Specifically, known vectors such as pCR-ScriptSK( + ) , 

35 pBluescriptKS (+ ) (both are manufactured by Stratagene), and pUC18 
can be used - After an appropriate host is transformed with the vector 
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including the insert and is grown, the DNA is recovered to make a 
genomic library. When pCR-ScriptSK(+) is used for transformation, 
it may be advantageous to use the host strains having excellent 
transformation efficiencies, such as .E. coii (competent cell ) XLl -Blue 
5 MRF' , XL2-Blue MRF ' / and TGI (all are manufactured by Stratagene) . 

Using the genomic library thus constructed, transf ormants 
containing the satellite sequences of interest can be cloned. 
Screening for the satellite sequences can be done based on the colony 
hybridization using a probe for a satellite sequence (Can. J. Fish. 

10 Biol. 51: 1959-1996, 1994) or on the primer extension (Proc. Natl. 
Acad. Sci. USA, 89: 3419-3423, 1992). In colony hybridization, 
colonies of the transformed cells as described above are transferred 
onto the filter, and hybridized with a probe having, for example, 
a repeat sequence of ( GT )^ . The nucleotide sequence of oligonucleotide 

15 that constitutes the probe is properly provided based on the satellite 
sequence of interest. Positive colonies are separated to isolate the 
clones containing the satellite sequence. 

In primer extension, on the other hand, the genomic library 
is recovered as single-stranded DNA. For example, if pCR-ScriptSK( + ) 

20 is used as the vector, the transformed coli cells can be infected 
with helper phage VGS-Ml 3 or the like to recover the inserts in a 
single-stranded form as recombinant phages. Since DNA packaged in 
the phage is protected against enzymatic action, RNA and DNA derived 
from E. coii can be enzymatically degraded when the DNA of interest 

25 is recovered, thereby achieving more efficient cloning . 

To the single-stranded plasmid DNA isolated from the phage, 
a primer specific to the satellite sequence is annealed, and then, 
DNA polymerase is applied. For example, repetitive sequences of 
(dA-dT)„ are generally (0.3% of the entire genomic DNA) found in the 

30 mammalian microsatellite sequences . In humans , such combinations as 
( dC-dA/dT-dG ) , and ( dC-dT/dA-dG ) ^ are also frequently observed . Based 
on such information, one can design primers that comprise a nucleotide 
sequence complementary to a satellite sequence and that anneal to 
the satellite sequence. It is advantageous to phosphorylate the 5' 

35 end of the primer beforehand. Vectors that contain a microsatellite 
sequence acquire the primer at this site and a double-stranded DNA 
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is synthesized, while those to which the primer does not anneal remain 
s ingle-stranded • Ligation is performed after the synthesis of 
complementary strand with DNA polymerase to complete the 
double-stranded circular DNA, and then the single-stranded DNA is 
5 digested with mung bean nuclease or the like/ Consequently, only the 
vectors containing the satellite sequences remain. These vectors can 
be recovered and used to transform cells , thereby cloning the vectors • 
After amplifying the cloned vectors containing the satellite sequences , 
one can recover the DNA and determine the sequences , thereby completing 

10 the isolation of the satellite sequences. 

When a satellite sequence is isolated together with its flanking 
regions, primers for PGR can be designed. For designing the primers, 
it is convenient to utilize a commerc ially available package of software 
for sequence analysis. For example, in the Example below, a primer 

15 designing software Primer Premier (Premier Biosoft International ) 
is used. The microsatellite sequences and minisatellite sequences 
isolated are useful as fingerprint markers for intraspecies analysis 
or as pedigree (or lineage) markers. If any satellite sequence 
obtained in the present invention exhibits polymorphism, such a 

20 nucleotide sequence can be used as an indicator for identification 
of individuals. Such a satellite sequence is called a fingerprint 
marker. Since satellite sequences used as fingerprint markers are 
inherited from the parents of an individual , analys is of the fingerprint 
markers conserved among multiple individuals indicates the probability 

25 that these individuals belong to a single pedigree. Such use of a 
satellite sequence is called a pedigree (or lineage) marker. 

Furthermore, the satellite sequences isolated in the present 
invention can also be used as genetic linkage markers, in addition 
to the use as fingerprint markers and pedigree (or lineage) markers. 

30 In the present invention, a satellite marker utilized as an indicator 
represented by the uses exemplified above is called a DNA marker. 

Brief Description of the Drawings 

Figure 1 shows the coloring image of a dot-blot assay with a 
35 microsatellite sequence-specific probe ((CA)io oligonucleotide). 
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Best Mode for Carrying out the Invention 

The present invention is described more specifically in the 
following examples. 

5 (1) Preparation of DNA fragments from black abalone 

Black abalone {Haliotis discus discus) DNA was isolated from 
the foot muscle according to the TNES-Urea method (Fisheries Sci/, 
62: 723-726, 1996). However, 4 M urea was used herein. The genomic 
DNA solution (20 yig/ml, 55 0 ^il) was sonicated (20 kHz, Amplitude 10, 

10 for one minute, five times) with cooling to fragment the DNA. The 
reaction mixture (52 ^1) containing the substrate DNA (about 10 fig) / 
30 mM CHaCOONa (pH 4.6), 50 mM NaCl, 1 mM (CH3COO)2Zn, 5% glycerol, 
and 60 units of mung bean nuclease (Toyobo) was incubated for 1 hour 
at 37 ""C to cleave the gap portions of the DNA fragments and to blunt 

15 the protruding ends. By electrophoresis on a 1^ 2% agarose gel , the 
DNA fragments of 300 to 500 bp were recovered. The reaction mixture 
(100 fil) containing the recovered substrate DNA, 2 0 \xM dNTP, 50 mM 
Tris/HCl (pH8.5), 7mMMgCl2, 15 mM (NH4 )2S04, 10 mM 2-mercaptoethanol , 
0.1 mM EDTA, and 10 units of T4 DNA polymerase (Toyobo) was incubated 

20 for 1 hour at 37°C to blunt the protruding ends of the DNA fragments. 
Further, the reaction mixture (50 containing the substrate DNA, 

0.2mMrATP, 50mMTris/HCl (pH7.6), lOmMMgCla, 10 mM 2-mercaptoethanol , 
and 10 units of T4 polynucleotide kinase (Toyobo) was incubated for 
1 hour at 37''C to phosphorylate the 5' ends of the DNA fragments. 

25 

(2) Ligation of genomic DNA fragments into plasmid vectors 

The reaction mixture (10 jil) containing pCR-Script SK( + ) vector 
(Stratagene) (about 1 ^g), 2 5 mM Tr is /Acetate (pH 7.6), 100 mM KOAc, 
10 mM MgOAc, 0.5 mM 2-mercaptoethanol, 10 fxg/ml BSA, and 10 units 

30 of restriction enzyme Srfl (Stratagene) was incubated for 1 hour or 
more at 37*^C, to cleave the vector at the Srfl site. The reaction 
mixture (50>il) containing the vector solution prepared, the substrate 
DNA (about 3 \iq) , 10 units of Srfl, 0.5 mM rATP, 66 mM Tris/HCl (pH 
7.6) , 6,6 mM MgCls, 10 mM dithiothreitol , and 20 units of T4 DNA ligase 

35 (Toyobo) was incubated overnight at 24 °C to ligate the DNA fragments 
into the vector. After inactivating the DNA ligase by heating at 65''C 



for 1 5 minutes , 1 0 units of Srf I was further added • The react ionmixture 
was incubated for 1 hour at 37°C to digest the self-ligated vector. 

(3 1 Preparation of single-stranded DNA 

To each of five culturing tubes, 100 \xl of XL2-Blue MRF ' 
ultracompetent cells (Epicurian coli ultracompetent cells; 
Stratagene) was added, and then, the ligated vector (about 200 ng) 
was added, followed by transformation according to the attached 
instruction manual. After 900 ^1 of the NZY medium (Maniatis, 
Molecular Cloning: A Laboratory Manual, 2nd ed. , Cold Spring Harbor 
Laboratory Press, New York, 1989) was added to each of the tubes, 
the cells were cultured for one hour at 3 7 ° C with shaking . Subsequently , 
ampicillin was added to the final concentration of 50 pig/ml . The cells 
were cultured for another one hour at 37 ""C with shaking to select 
the trans formed K. coli. To each of the tubes , about 10'°pfu of VCS-M13 
helper phage was added. The cells were allowed, to stand for 2 0 minutes 
at 3 7 ""C, followed by addition of kanamycin at the final concentration 
of 70 ^ig/ml. The cells were cultured for 1 hour at 37 ""C with shaking 
to select the^. coli infected with the helper phage. All the cultures 
were put together and added to 100 ml of the Terrific medium (Maniatis, 
Molecular Cloning: A Laboratory Manual, 2nd ed. , Cold Spring Harbor 
Laboratory Press, New York, 1989) containing ampicillin and kanamycin. 
The cells were cultured for 14 hours at 37°C with shaking. The Terrific 
medium was centrifuged at 14,000 rpm for 10 minutes at O'^C, and the 
supernatant was recovered and filtered twice through a 45 ^m filter. 
After precipitated twice with PEG according to the standard method 
(Muramatsu, Lab Manual of Genetic Engineering, 3rded. , Maruzen, Tokyo, 
1996, pp. 51-55), the DNA and RNA derived from E. coli were digested 
according to a known method (Jpn. Poult. Sci. 33: 292-299, 1996). 
Subsequently, PEG precipitation, phenol extraction, and ethanol 
precipitation were performed to purify the single-stranded DNA. 

(4) Selection of (CA) ^ -positive plasmids 

Primer extension was performed using the (CA)i2 oligonucleotide 
to select the plasmid DNA into which DNA fragments with (TG/CA) repeats 
had been inserted. The solution (98 iliI) containing the single-stranded 
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DNA (about 3 \ig) , 0.2 mM dNTP, 20 mM Tris/HCl (pH 8.8), 10 mM KCl, 
10 mM {NH4)2S04, 2 mM MgSO^, 100 \xg/ml BSA/0.1% Triton X-100, and 100 
pmol (CA)i2 oligonucleotide was pre-heated at 72 ""C for 10 minutes. 
Five units of Pfu DNA polymerase ( Stratagene ) was added to the solution , 
5 which was then overlaid with mineral oil and incubated for 3 0 minutes 
at 72*^C. Subsequently, the generated double-stranded DNA was 
recovered by phenol extraction and ethanol precipitation and was 
dissolved in 10 \il of sterilized water. After circularizing the 
double-stranded DNA by using the ligation kit (Ligation high / Toyobo ) , 

10 the DNA ligase was inactivated by heat treatment ( 65 °C f or 15 minutes ) . 
The reaction mixture (100 containing this solution, 30mMCH3COONa 
(pH 4.6), 50 mM NaCl, 1 mM (CH3COO)2Zn, 5% glycerol, and 30 units of 
mung bean nuclease (Toyobo) was incubated for 2 hours or more at 3 7*^C 
to digest the single-stranded DNA which had not been primer-extended. 

1 5 DNA was recovered by phenol/chloroform extraction and ethanol 
precipitation, and was dissolved in 60 fil of TE buffer. 

With the recovered DNA (equivalent to 100 ^ig of the 
single-stranded DNA) , the XL2-Blue MRF ' Ultracompetent cells were 
transformed. The transformed cells were spread onto a 2xYT agar medium 

20 containing 50 ptg/ml ampicillin (Maniatis , Molecular Cloning: A 
Laboratory Manual, 2nd ed. , Cold Spring Harbor Laboratory Press , New 
York, 1989), and cultured overnight at 3 7 ""C. Single colonies on the 
culture plate were picked up at random, and cultured overnight at 
3 7°C with shaking in 2 ml of 2xYT medium (Maniatis, Molecular Cloning: 

25 A Laboratory Manual, 2nd ed. , Cold Spring Harbor Laboratory Press, 
New York, 1989) containing ampicillin at the final concentration of 
5 0 [ig/ml. The plasmid was extracted by the alkaline method according 
to the standard procedure (Muramatsu, Lab Manual of Genetic Engineering, 
3rd ed. , Maruzen, Tokyo, 1996, pp. 51-55), and dissolved in 50 ^il of 

3 0 TE buffer. 

The purified plasmid DNA (equivalent to 4 ng of the 
single-stranded DNA) was alkaline-denatured with 0 . 5 NNaOH and blotted 
onto a nylon membrane (No. 1209299, Boehringer Mannheim) . Themembrane 
was baked at 120''C for 30 minutes. Using (CA)io oligonucleotide 
35 digoxigenin-labeled with the DIG oligonucleotide labeling kit (DIG 
Oligonucleotide Tailing Kit, Boehringer Mannheim) as a probe, and 
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using the DIG nucleic acid detection kit (DIG Nucleic Acid Detection 
Kit, Boehringer Mannheim), (CA)„-positive plasmids were detected 
according to the manufacturer's instruction. Part of the results is 
shown in Figure 1. 

5 

(5) Cycle sequencing 

For the (CA)„-positive clones, nucleotide sequences were 
determined using cycle sequencing method. The KS primer labeled with 
Cy5 at its 5' end, the Reverse primer, and the sequencing kit 

10 ( ThermoSequence fluorescent labeled primer cycle sequencing kit with 
7-deaza-dGPT; Amersham) were used for the sequencing reaction. The 
fluorescent DNA sequencer (ALFexpress, Pharmacia) was used for 
sequencing. Based on the nucleotide sequence data thus obtained, 
optimal primers were designed for the region encompassing the CA repeats 

15 using software for primer design (Primer Premier; Premier Biosoft 
International). The nucleotide sequences of the microsatellite 
sequences isolated from the black abalone are shown below. Repetitive 
units are shown in the parentheses so that they are easily recognized. 
The actual sequences are as shown in the sequence listing (the numbers 

20 correspond to the SEQ ID NOs ) . For the regions 1 to 24, PCR primers 
can be designed, and polymorphism was identified for the nine regions 
(1, 3, 8, 10, 13, 15, 16, 19, and 24). 

1. (CA)4i 

2. (GACT)2(CTCA)7(CA)2CT(CA)9 
25 3. C5CAC2(CA)i2TA(CA)8 

4. (CA), 

5. (CA),, 

6. (GA)2CAGA(CA)5 

7. CA3GA2C3A3(CA)5 

3 0 8. (CGCA)9TGCAC2(CA)2 

9. (CT)3(CA2)3(CA)5 

10. (CA)25 

11. CACT(CA)i6TACA 

12. CA3(CA)2T(CA)4 
35 13. (CA)3o 

14. CA2GCA2C(CA)25 
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15. (CA)2CT(CA)i3(CGCA)i,(CA), 

16. (CA)e(CG)4 

17. (CA),(CG)4 
18- (CA)5 

5 19. (CA),, 

20. TACATA(CA)i2 

21. (CA)2CA3(CA)6 

22. (CA)2AC(CA)3AC(CAC)2(CA)5 

23. (GA)e(TGCA)2 
10 24. (CA)34 

25. (CA),(CGCA)2CGA2(CGCA)2A2(CA)2(CG)2 

26. (CA)e 

27. CAC8(CA)9C4 
28v (CA),(GA)2 

15 29. (CA)26 

30. (CA)25 

31. CAG(CA)5TACA 

32. ( CA3 ) 3 ( CA ) ,CA3 ( CA ) 12G2CA ( CG2 ) 3 

About 85% (41 clones) of the (TG/CA) ^-concentrated library 
20 V constructed based on the sequence- independent cleavage method using 

sonication was found to be (CA) ^-positive clones ( Figure 1) . Out of 

them, 32 ( CA)„-positive clones were selected at random and sequenced. 

AS a result, the PGR primers can be designed for 24 clones (72%). 

For the rest of the clones (9 clones), it was impossible to design 
25 the primers because some repeat regions were adjacent to the ligation 

site on the vector, because some regions were ligated into the vector 

in the middle of the repeat, or because some repeats were scattered 

throughout the DNA fragment. 

30 Industrial Applicability 

The present invention makes it possible to efficiently isolate 
satellite sequences such as microsatellite sequences. Since the 
present invention utilizes a sequence-independent cleavage method 
for fragmentation of a genomic DNA, it is not influenced by the 

35 nucleotide sequence, and enables producing unbiased libraries. 
Satellite sequences isolated from the unbiased library contain reduced 
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redundancy. Thus, the present invention enables efficiently 
isolating satellite sequences. 

For instance, the present inventors have succeeded in obtaining 
a (TG/CA) ^.-concentrated library comprising 32 clones by a single 
5 manipulation from black abalone from which no microsatellite sequences 
has been isolated previously. The clones obtained from this library 
are free from redundancy. It can be concluded that, compared with 
the previous method that produced many duplicated clones for chicken, 
the present method has increased the efficiency of the isolation. 

10 The isolated clones can serve for designing PGR primers as well as 
show reduced sequence redundancy, which is another striking effect 
of the present invention. For instance, calculated from the result 
of the example, at least 60% (0.85 x 24/32) of the clones cultured 
on the plate can be used to design PGR primers. These results reveal 

15 that the methods of the present invention for isolating microsatellite 
sequences are excellent methods that can be widely applied. 

Furthermore, the satellite sequences isolated by the methods 
of the present invention can be utilized as DNA markers, such as 
fingerprint markers, pedigree (or lineage) markers, and genetic 

20 linkage markers. 



CLAIMS 



An isolation method for satellite sequences^ wherein a genomic 
DNA is cleaved by a nucleotide sequence-independent method^ the 
isolation method comprising: 

obtaining randomly cleaved fragments of the genomic DNA and 
selecting, from the fragments obtained in a) , fragments comprising 
the satellite sequences 

The isolation method of claim 1, wherein the nucleotide 
sequence-independent method is a physical cleavage method or an 
enzymatic cleavage method. 

The isolation method of claim 2, wherein the physical cleavage 
method is sonication. 

The isolation method of claim 3, wherein the ends of the genomic 
DNA that have been fragmented by sonication are to be blunted. 
The isolation method of claim 4, wherein the ends are to be blunted 
with DNA polymerase having single strand-specif ic endonuclease 
activity and 3'^ 5' exonuclease activity. 

The isolation method of claim 2, wherein a nucleotide 
sequence-independent endonuclease is used in the enzymatic cleavage 
method. 

The isolation method of claim 6, wherein the nucleotide 
sequence- independent endonuclease is DNase I. 

The isolation method of claim 1, wherein the satellite sequences 
are microsatellite sequences. 

Use of satellite sequences isolated by the isolation method of 
any one of claims 1 to 8 as DNA markers. 



ABSTRACT 



A highly homogeneous library can be obtained by cleaving a genomic 
DNA by a sequence-independent cleavage method, such as sonication, 
By selecting satellite sequences from such a library, efficiency of 
isolation is improved. Thus, an efficient method of isolating 
microsatellite sequences , which are useful as DNA markers , is provided. 
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Figure 1 
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^SEQUENCE LISTING 



<110> Takahashi, Hideaki 
Sekino, Masashi 
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<12 0> METHOD FOR ISOLATING SATELLITE SEQUENCES 



<130> 201487/1060 

<140> 09/762,867 
<141> 1999-07-01 

<150> JP 10/232153 
<151> 1998-08-18 

<150> PCT/JP99/03551 
<151> 1999-07-01 

<160> 32 



<170> Patentin Ver. 2.1 



<210> 1 
<211> 82 
<212> DNA 

<213> Haliotis discus discus 
<400> 1 

cacacacaca cacacacaca cacacacaca cacacacaca cacacacaca cacacacaca 60 
cacacacaca cacacacaca ca 82 



<210> 2 
<211> 60 
<212> DNA 

<213> Haliotis discus discus 
<400> 2 

gactgactct cactcactca ctcactcact cactcacaca ctcacacaca cacacacaca 60 



<210> 3 
<211> 51 
<212> DNA 

<213> Haliotis discus discus 
<400> 3 

ccccccaccc acacacacac acacacacac acatacacac acacacacac a 



1 
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<210> 4 
<211> 14 
<212> DNA 

<213> Haliotis discus discus 
<400> 4 

cacacacaca caca 



<210> 5 
<211> 32 
<212> DNA 

<213> Haliotis discus discus 
<400> 5 

cacacacaca cacacacaca cacacacaca ca 



<210> 6 
<211> 18 
<212> DNA 

<213> Haliotis discus discus 
<400> 6 

gagacagaca cacacaca 



<210> 7 
<211> 23 
<212> DNA 

<213> Haliotis discus discus 
<400> 7 

caaagaaccc aaacacacac aca 



<210> 8 
<211> 46 
<212> DNA 

<213> Haliotis discus discus 
<400> 8 

cgcacgcacg cacgcacgca cgcacgcacg cacgcatgca cccaca 



<210> 9 



2 
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<211> 25 



<212> DNA 



<213> Haliotis discus discus 



<400> 9 



ctctctcaac aacaacacac acaca 



25 



<210> 10 
<211> 50 
<212> DNA 

<213> Haliotis discus discus 
<400> 10 

cacacacaca cacacacaca cacacacaca cacacacaca cacacacaca 50 

<210> 11 
<211> 40 
<212> DNA 

<213> Haliotis discus discus 



<210> 12 
<211> 17 
<212> DNA 

<213> Haliotis discus discus 
<400> 12 

caaacacatc acacaca 17 

<210> 13 
<211> 60 
<212> DNA 

<213> Haliotis discus discus 
<400> 13 

cacacacaca cacacacaca cacacacaca cacacacaca cacacacaca cacacacaca 60 

<210> 14 
<211> 58 
<212> DNA 

<213> Haliotis discus discus 



<400> 11 



cactcacaca cacacacaca cacacacaca cacacataca 



40 



3 
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<400> 14 

caagcaacca cacacacaca cacacacaca cacacacaca cacacacaca cacacaca 58 



<210> 15 
<211> 88 
<212> DNA 

<213> Haliotis discus discus 
<400> 15 

cacactcaca cacacacaca cacacacaca cacgcacgca cgcacgcacg cacgcacgca 60 
cgcacgcacg cacgcacaca cacacaca 88 



<210> 16 
<211> 24 
<212> DNA 

<213> Haliotis discus discus 
<400> 16 

cacacacaca cacacacgcg cgcg 24 



<210> 17 
<211> 20 
<212> DNA 

<213> Haliotis discus discus 
<400> 17 

cacacacaca cacgcgcgcg 2 0 



<210> 18 
<211> 10 
<212> DNA 

<213> Haliotis discus discus 
<400> 18 

cacacacaca iO 



<210> 19 
<211> 52 
<212> DNA 

<213> Haliotis discus discus 
<400> 19 



4 
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cacacacaca cacacacaca cacacacaca cacacacaca cacacacaca ca 52 



<210> 20 
<211> 30 
<212> DNA 

<213> Haliotis discus discus 
<400> 20 

tacatacaca cacacacaca cacacacaca 3 0 



<210> 21 
<211> 20 
<212> DNA 

<213> Haliotis discus discus 
<400> 21 

cacacaaaca cacacacaca 



<210> 22 
<211> 30 
<212> DNA 

<213> Haliotis discus discus 
<400> 22 

cacaaccaca caaccaccac cacacacaca ^0 



<210> 23 
<211> 24 
<212> DNA 

<213> Haliotis discus discus 
<400> 23 

cacacacaca cacacatgca tgca 



<210> 24 
<211> 68 
<212> DNA 

<213> Haliotis discus discus 
<400> 24 

cacacacaca cacacacaca cacacacaca cacacacaca cacacacaca cacacacaca 60 

68 

cacacaca 



5 
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<210> 25 
<211> 44 
<212> DNA 

<213> Haliotis discus discus 
<400> 25 

cacacacaca cacacgcacg cacgaacgca cgcaaacaca cgcg 44 



<210> 26 
<211> 16 
<212> DNA 

<213> Haliotis discus discus 
<400> 26 

cacacacaca cacaca 



<210> 27 
<211> 32 
<212> DNA 

<213> Haliotis discus discus 
<400> 27 

cacccccccc cacacacaca cacacacacc cc 32 



<210> 28 
<211> 16 
<212> DNA 

<213> Haliotis discus discus 
<400> 28 

cacacacaca cagaga 



<210> 29 
<211> 52 
<212> DNA 

<213> Haliotis discus discus 



<400> 29 
cacacac 



aca cacacacaca cacacacaca cacacacaca cacacacaca ca 52 



<210> 30 
<211> 50 



6 
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<212> DNA 

<213> Haliotis discus discus 
<400> 30 

cacacacaca cacacacaca cacacacaca cacacacaca cacacacaca 



<210> 31 
<211> 17 
<212> DNA 

<213> Haliotis discus discus 
<400> 31 

cagcacacac acataca 



<210> 32 
<211> 61 
<212> DNA 

<213> Haliotis discus discus 
<400> 32 

caaacaaaca aacacacaca caaacacaca cacacacaca cacacacagg cacggcggcg 6 0 
^ 61 



7 
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SEQUENCE LISTING 

<llfl> National Institute of Agrobiological Resources 

<120> Method for isolating satellite sequences 

<130> MOA-004PCT 

<140> 
<141> 

<150> JP 10-232153 
<151> 1998-08-18 

<160> 32 

<170> Patentin Ver. 2.0 

<210> 1 

<211> 82 

<212> DNA 

<213> Haliotis discus discus 



<400> 1 

cacacacaca cacacacaca cacacacaca cacacacaca cacacacaca cacacacaca 60 
cacacacaca cacacacaca ca 82 
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<210> 2 
<211> 60 
<212> DNA 

<213> Haliotis discus discus 
<400> 2 

gactgactct cactcactca ctcactcact cactcacaca ctcacacaca cacacacaca 60 

<210> 3 
<211> 51 
<212> DNA 

<213> Haliotis discus discus 
<400> 3 

ccccccaccc acacacacac acacacacac acatacacac acacacacac a 51 

<210> 4 
<211> 14 
<212> DNA 

<213> Haliotis discus discus 
<400> 4 

cacacacaca caca 14 



<210> 5 
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<211> 32 
<212> DNA 

<213> Haliotis discus discus 
<400> 5 

cacacacaca cacacacaca cacacacaca ca 32 

<210> 6 
<211> 18 
<212> DNA 

<213> Haliotis discus discus 
<400> 6 

gagacagaca cacacaca 18 

<210> 7 
<211> 23 
<212> DNA 

<213> Haliotis discus discus 
<400> 7 

caaagaaccc aaacacacac aca 23 

<210> 8 
<211> 46 
<212> DNA 
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<213> Haliotis discus discus 
<400> 8 

cgcacgcacg cacgcacgca cgcacgcacg cacgcatgca cccaca 

<210> 9 
<211> 25 
<212> DNA 

<213> Haliotis discus discus 
<400> 9 

ctctctcaac aacaacacac acaca 

<210> 10 
<211> 50 
<212> DNA 

<213> Haliotis discus discus 
<400> 10 

cacacacaca cacacacaca cacacacaca cacacacaca cacacacaca 

<210> 11 
<211> 40 
<212> DNA 

<213> Haliotis discus discus 
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<400> 11 

cactcacaca cacacacaca cacacacaca cacacataca 40 

<210> 12 
<211> 17 
<212> DNA 

<213> Haliotis discus discus 



<400> 12 

caaacacatc acacaca 

<210> 13 
<211> 60 
<212> DNA 

<213> Haliotis discus discus 
<400> 13 

cacacacaca cacacacaca cacacacaca cacacacaca cacacacaca cacacacaca 60 



<210> 14 
<211> 58 
<212> DNA 

<213> Haliotis discus discus 



<400> 14 

caagcaacca cacacacaca cacacacaca cacacacaca cacacacaca cacacaca 58 
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<210> 15 
<211> 88 
<212> DNA 

<213> Haliotis discus discus 
<400> 15 

cacactcaca cacacacaca cacacacaca cacgcacgca cgcacgcacg caegcaegca 60 
cgcacgcacg cacgcacaca cacacaca 88 



<210> 16 

<211> 24 

<212> DNA 

<213> Haliotis discus discus 

<400> 16 

cacacacaca cacacacgcg cgcg 24 

<210> 17 

<211> 20 

<212> DNA 

<213> Haliotis discus discus 



<400> 17 

cacacacaca cacgcgcgcg 



20 
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<210> 18 
<211> 10 
<212> DNA 

<213> Hal iot is discus discus 
<400> 18 

cacacacaca 10 

<210> 19 
<211> 52 
<212> DNA 

<213> Haliotis discus discus 
<400> 19 

cacacacaca cacacacaca cacacacaca cacacacaca cacacacaca ca 52 

<210> 20 
<211> 30 
<212> DNA 

<213> Haliotis discus discus 
<400> 20 

tacatacaca cacacacaca cacacacaca 30 



<210> 21 
<211> 20 
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<212> DNA 

<213> Haliotis discus discus 

<400> 21 

cacacaaaca cacacacaca 20 

<210> 22 

<211> 30 

<212> DNA 

<213> Haliotis discus discus ^ 

<400> 22 

cacaaccaca caaccaccac cacacacaca 30 

<210> 23 

<211> 24 

<212> DNA 

<213> Haliotis discus discus 

<400> 23 

cacacacaca cacacatgca tgca 24 

<210> 24 

<211> 68 

<212> DNA 

<213> Haliotis discus discus 
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<400> 24 

cacacacaca cacacacaca cacacacaca cacacacaca cacacacaca cacacacaca 60 
cacacaca 68 

<210> 25 
<211> 44 
<212> DNA 

<213> Haliotis discus discus 
<400> 25 

cacacacaca cacacgcacg cacgaacgca cgcaaacaca cgcg 44 



<210> 26 
<211> 16 
<212> DNA 

<213> Haliotis discus discus 
<400> 26 

cacacacaca cacaca 

<210> 27 
<211> 32 
<212> DNA 

<213> Haliotis discus discus 
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<400> 27 

cacccccccc cacacacaca cacacacacc cc 32 

<210> 28 
<211> 16 
<212> DNA 

<213> Haliotis discus discus 
<400> 28 

cacacacaca cagaga 16 

<210> 29 
<211> 52 
<212> DNA 

<213> Haliotis discus discus 
<400> 29 

cacacacaca cacacacaca cacacacaca cacacacaca cacacacaca ca 52 

<210> 30 
<211> 50 
<212> DNA 

<213> Haliotis discus discus 
<400> 30 



cacacacaca cacacacaca cacacacaca cacacacaca cacacacaca 



50 



<210> 31 
<211> 17 
<212> DNA 

<213>Haliotis discus discus 
<400> 31 

cagcacacac acataca 

<210> 32 
<211> 55 
<212> DNA 

<213> Haliotis discus discus 
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17 



<400> 32 

caaacaaaca aacacacaca caaacacaca cacacacaca cacacacagg cacggcggcg 60 
R 61 



